Competing with the Empirical Risk Minimizer in a Single Pass

نویسندگان

  • Roy Frostig
  • Rong Ge
  • Sham M. Kakade
  • Aaron Sidford
چکیده

In many estimation problems, e.g. linear and logistic regression, we wish to minimize an unknown objective given only unbiased samples of the objective function. Furthermore, we aim to achieve this using as few samples as possible. In the absence of computational constraints, the minimizer of a sample average of observed data – commonly referred to as either the empirical risk minimizer (ERM) or the M -estimator – is widely regarded as the estimation strategy of choice due to its desirable statistical convergence properties. Our goal in this work is to perform as well as the ERM, on every problem, while minimizing the use of computational resources such as running time and space usage. We provide a simple streaming algorithm which, under standard regularity assumptions on the underlying problem, enjoys the following properties: 1. The algorithm can be implemented in linear time with a single pass of the observed data, using space linear in the size of a single sample. 2. The algorithm achieves the same statistical rate of convergence as the empirical risk minimizer on every problem, even considering constant factors. 3. The algorithm’s performance depends on the initial error at a rate that decreases super-polynomially. 4. The algorithm is easily parallelizable. Moreover, we quantify the (finite-sample) rate at which the algorithm becomes competitive with the ERM.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Introduction to Competing Risk Model In the Epidemiological Research

Background and aims: Chronic kidney disease (CKD) is a public health challenge worldwide, with adverse consequences of kidney failure, cardiovascular disease (CVD), and premature death. Chronic kidney disease leads to the end stage of renal disease (ESRD), if late/not diagnosed. Competing risk modeling is a major issue in Epidemiology research. In Epidemiology study, sometimes inappropriate met...

متن کامل

Comparison of Random Survival Forests for Competing Risks and Regression Models in Determining Mortality Risk Factors in Breast Cancer Patients in Mahdieh Center, Hamedan, Iran

Introduction: Breast cancer is one of the most common cancers among women worldwide. Patients with cancer may die due to disease progression or other types of events. These different event types are called competing risks. This study aimed to determine the factors affecting the survival of patients with breast cancer using three different approaches: cause-specific hazards regression, subdistri...

متن کامل

Single DV-DXCCII Based Voltage Controlled First Order All-pass Filter with Inverting and Non-inverting responses

In this paper, a new voltage controlled first order all-pass filter is presented. The proposed circuit employs a single differential voltage dual-X second generation current conveyor (DV-DXCCII) and a grounded capacitor only. The proposed all-pass filter provides both inverting and non inverting voltage-mode outputs from the same configuration simultaneously without any matching condition. Non-...

متن کامل

بررسی عوامل خطر بر زمان بقای لوسمی حاد کودکان با استفاده از مدل ریسک رقابتی

Background and purpose: Acute leukemia is a malignancy with a rapidly progressive course of the lymphoid or myeloid cells. A prognostic factor for its survival and treatment is of particular importance in medical research. Therefore, this study aimed at applying competing risk model to compare clinical and pathologic factors in pediatric acute leukemia. Materials and methods: A historical coho...

متن کامل

شناسایی برخی عوامل خطر زمان تا مرگ سالمندان، با به‌کارگیری مدل آمیخته نیمه‌پارامتری مخاطرات رقیب تحلیل بقا

Objectives As the population of elderly people in Iran is rising, determining the risk factors of their death is necessary. The purpose of this study was to identify the risk factors that reduce the survival time of elderly people. Methods & Materials In a longitudinal retrospective study, data of 510 elderly people aged over 60 years, who were admitted to Kashan’s Golabchi nursing home ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015